bn 1
Machine Learning and Control Theory
Bensoussan, Alain, Li, Yiqun, Nguyen, Dinh Phan Cao, Tran, Minh-Binh, Yam, Sheung Chi Phillip, Zhou, Xiang
We survey in this article the connections between Machine Learning and Control Theory. Control Theory provide useful concepts and tools for Machine Learning. Conversely Machine Learning can be used to solve large control problems. In the first part of the paper, we develop the connections between reinforcement learning and Markov Decision Processes, which are discrete time control problems. In the second part, we review the concept of supervised learning and the relation with static optimization. Deep learning which extends supervised learning, can be viewed as a control problem. In the third part, we present the links between stochastic gradient descent and mean-field theory. Conversely, in the fourth and fifth parts, we review machine learning approaches to stochastic control problems, and focus on the deterministic case, to explain, more easily, the numerical algorithms.
- Asia > China > Hong Kong (0.05)
- North America > United States > Texas > Dallas County > Dallas (0.04)
- Asia > Vietnam > Khánh Hòa Province > Nha Trang (0.04)
- Asia > Middle East > Jordan (0.04)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.55)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.34)
Convergence and Consistency of Regularized Boosting Algorithms with Stationary B-Mixing Observations
Lozano, Aurelie C., Kulkarni, Sanjeev R., Schapire, Robert E.
We study the statistical convergence and consistency of regularized Boosting methods, where the samples are not independent and identically distributed(i.i.d.) but come from empirical processes of stationary β-mixing sequences. Utilizing a technique that constructs a sequence of independent blocks close in distribution to the original samples, we prove the consistency of the composite classifiers resulting from a regularization achievedby restricting the 1-norm of the base classifiers' weights. When compared to the i.i.d.
- North America > United States > New York (0.05)
- North America > United States > New Jersey > Mercer County > Princeton (0.05)